|
Bilingual collaborative Chinese relation extraction based on parallel corpus
GUO Bo, FENG Xupeng, LIU Lijun, HUANG Qingsong
Journal of Computer Applications
2017, 37 (4):
1051-1055.
DOI: 10.11772/j.issn.1001-9081.2017.04.1051
In the relation extraction of Chinese resources, the long Chinese sentence style is complex, the syntactic feature extraction is very difficult, and its accuracy is low. A bilingual cooperative relation extraction method based on a parallel corpus was proposed to resolve these above problems. In a Chinese and English bilingual parallel corpus, the English relation extraction classification was trained by dependency syntactic features which obtained by mature syntax analytic tools of English, the Chinese relation extraction classification was trained by n-gram feature which is suitable for Chinese, then they constituted bilingual view. Finally, based on the annotated and mapped parallel corpus, the training corpus with high reliability of both classifications were added to each other for bilingual collaborative training, and a Chinese relation extraction classification model with better performance was acquired. Experimental results on Chinese test corpus show that the proposed method improves the performance of Chinese relation extraction method based on weak supervision, its
F value is increased by 3.9 percentage points.
Reference |
Related Articles |
Metrics
|
|